home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
ftp.cs.arizona.edu
/
ftp.cs.arizona.edu.tar
/
ftp.cs.arizona.edu
/
icon
/
newsgrp
/
group98c.txt
/
000021_icon-group-sender _Mon Sep 14 08:23:43 1998.msg
< prev
next >
Wrap
Internet Message Format
|
2000-09-20
|
2KB
Return-Path: <icon-group-sender>
Received: from kingfisher.CS.Arizona.EDU (kingfisher.CS.Arizona.EDU [192.12.69.239])
by baskerville.CS.Arizona.EDU (8.9.1a/8.9.1) with SMTP id IAA06280
for <icon-group-addresses@baskerville.CS.Arizona.EDU>; Mon, 14 Sep 1998 08:23:42 -0700 (MST)
Received: by kingfisher.CS.Arizona.EDU (5.65v4.0/1.1.8.2/08Nov94-0446PM)
id AA01577; Mon, 14 Sep 1998 08:23:15 -0700
To: icon-group@optima.CS.Arizona.EDU
Date: 12 Sep 1998 07:30:31 GMT
From: MUSKB@luff.latrobe.edu.au (Bastin,kim)
Message-Id: <6td7un$cpr$2@news.latrobe.edu.au>
Organization: La Trobe University
Sender: icon-group-request@optima.CS.Arizona.EDU
References: <4FD6422BE942D111908D00805F3158DF0757B57E@RED-MSG-52>
Subject: Re: Unicode support or support for non-ASCII based character ma
Errors-To: icon-group-errors@optima.CS.Arizona.EDU
Status: RO
Todd Proebsting (toddpro@microsoft.com) wrote:
: Jcon (http://www.cs.arizona.edu/icon/jcon) originally used Java's Unicode
: strings as the basis for implementing Icon strings. This was trivial to do,
: and we (Gregg Townsend and I) would have kept that implementation except
: that it was much too slow. I blame this in large part on the Java "String"
: implementation, and not on Unicode. (Although Unicode obviously wastes a
: lot of space in applications that are just using the ASCII subset.)
: Prior to switching to ASCII, we never faced the problem of what was the
: "right" thing to do with pre-defined csets with Unicode. For instance, what
: are the elements of the cset &ucase in an Unicode world?
: Todd Proebsting
Icon's &ucase (A-Z) is based on the English alphabet. Other European
languages would add @ADCG... etc. But it doesn't stop there.
Whether you have upper vs lowercase at all depends on the alphabet.
Not all alphabets have such a distinction. Some have other distinctions
that are novel to users of the Roman alphabet. For example, Greek,
Hebrew and Arabic have special 'final' forms of some characters (used at
the end of a word).
Kim Bastin